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I. Statement of the Problem: 


The purpose of this thesis is to investigate the problems 
associated with information flow in large communication nets, These 
problems appear to have wide application, and yet, little serious 
research has been conducted in this field. The nets under consideration 
consist of nodes, connected to each other by links. The nodes receive, 
sort, store, and transmit massages that enter and leave via the links. 
The links consist of one-way channels, with fixed capacities. Among the 
typical systems which fit this description are the Post Office System, 
telegraph systems, and satellite communication systems. 

A number of interesting and important questions can be asked 
about this system, and it is the purpose of this research to investigate 
the answers to some of these questions. A partial list of such ques- 
tions might be as follows: 

(1) What is the probability density distribution for the 

total time lapse between the initiation and reception of a 

message between any two nodes? In particular, what is the 

expected value of this distribution? - 

{2) Can one discuss the effective channel capacity between 

any two nodes? 

43) Is it possible to predict the transient behavior and 

recovery time of the net under sudden changes in the 

traffic statistics? 

{4) How large should the storage capacity be at each node? 

43) In what way does one arrive at a routing doctrine for 

incoming messages in different nets? In fact, can one state 

some bounds on the optimum performance of the net, independ-' 
eat of the routing doctrine (under some constraint on tha 


set of allowable doctrines)? 


{6) Under what conditions does the net jam up, i.e, 
present cn excessive delay in transmitting messages through 
the net? The solution to this problem will dictate the 
extent to which the capacity of each link can be used (ises, 
the vatio of rate to channel capacity, which is commonly 
known as the utilization factor). r 
(7) What are the effects of such things as additional intra~ 
node delays, and priozity messages? 


One other variable in the system is the amount of information that each 
node has shout Che state of the system (i.e., how long the queues are 
in each other node), it is clear that these are critical questions which 
need ansvers, and it is the intent of this research to answer some of 
than. 

In attempting the solution of some of these problems, it may 
well be that the study of a specific system or application wilt expose 
the basis for an under Standing of the problem, It is anticipated that 


such a study, as well as a simulation of the system on a digital computer , 
vill be undertaken in the course of this research, 


ii, History of the Problem. 


‘The application of Probability Theory to problens of telephone 
represents the earliest area of investigation related to the 


presen communication network problem, ‘he first work in this direction 

dates back to 1997 and 1908 thea E, Johannsen ujt? pubiished two 

essays, the one dealing with delays to incoming calls in a manual tele- 
kone exchange, and the other being an investigation as to how often sub- 


seribevs with one ov move lines are reporsed "busy," Tt was 


Wee Johannsen who encouraged A,X. Erlang to investigate problems of this 


nature, ang, an engineer with the Copenhagen telephone exchange, 
uade a number of major contxibutions to the theory of telephone traffic, 
all of which are translated and reported in [1]; his first paper (on the 


Poissonicn distribution cf 


mocaing calls} appeared in 1999 and the paper 


eoataining the results of his main work vas published in 1917, ia which 
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ah 
he considered the effect of fluctuations of service demands on the util- 
ization of the automatic equipment: in the telephone exchange, 

A few other workers made some contributions in this direction 
around this time, and a good account of the existing theories up to 
1920 is given by O'Dell [2,3]; his principal work on grading appeared 
in 1927, Molina[4,5] was among the writers of that time, many of whom 
were concerned mainly with attempts at proving or disproving Erlang's 
formulas, as well as to modify these formulas, 

The theory of stochastic processes was developed after Erlang's 
work, In fact it was Erlang who first introduced the concept of statis- 
tical equilibrium, and called attention to the study of distributions 
of holding times and of incoming calls. Much of modern queuing theory 
is devoted to the extension of these basic principles with the help of 
more recent mathematical tools. 

In 1928, T.C, Pry {6} published his book (which has since 
become a classic text) in which he offered a fine survey of congestion 
problems, He was the first to unify all previous works up to that time, 
Another prominent writer of that period was C, Palm [7,8], who was the 
first to use generating functions, in studying the formulas of Exlang 
and O'Dell. His works appeared in 1937-19338. During this time, a 
large numbex of specific cases were investigated, using the theories 
already developed, in particular lost call problems, Both Fry and 
Palm formulated equations {now recognized as the Birth and Death 
equations) which provide the foundation for the modern theory of conges- 
tion. 

In 1939, Feller {9} iatroduced the concept of the Birth and 
Death process, and ushered in the modern theory of congestion, His 
application was in physics and biology, but it was clear that the sama 
process characterized many models useful in telephone traffic problems. 
Numerous applications of these equations were made by Palm [10] in 
1943 In 1945, Jensen (see [1]) also used this process, without men- 


tioning its name, for the elucidation of Srlang's work. Kosten [11], 
in 1949, studied the probability of loss by means of generalized Birth 


Sis 
and Death equetions. Waiting line and trunking problems were explained 
by Feller [12] in his widely used book on probability, making use of 
the theory of stochastic processes, At around 193¢, the problems of 
waiting lines and trunking problems in telephone systems were taken up 
more by mathematicians than by telephone engineers, 

In 1950, C.E. Shannon [13] considered the problem of storage 
requirements in telephone exchanges, and concluded that a bound can be 
placed on the size of such storage, by estizating the amount of inforna- 
tion used in making the required connections, In 1951, F. Riordan [13] 
investigated a new method of approach suitable for general stochastic 
processes, R. Syski [15], in 1960, published a fire book in which he 
presented a summary of the theory of congestion anc stochastic proces- 
ses in telephone systems, and also cast some of the more advanced mathe- 
matical descriptions in common engineering terms. 

In the earthy 1950's, it became obvious that many of the results 
obtained in tie field of telephony were applicable in much more general 
situations, and so started investigations into waiting lines of many 
binds, waich has developed into modern Queuing Thecry, itself a branch 
of Operations Research. A great deal of effort has been spent on single 
node facilities, i.e., a system in which "customers" enter, join a queue, 
eventually obtain "service" and upon completion of this service, leave 
the system, P.M, Morse [16] presents a fine introduction to such faci- 
lities in which he defines terms, indicates applications, and outlines 
some of the analytic aspects of the theory, P. Buske [17], in 1956, 
showed that for indepeudent inter-arrival times (i,e., Poisson arrival), 
and exponential distribution of service times, the inter-departure times would 
also be iadepundeat (Poisson). In 1959, F. Foster [18] presented a 
duality principle in which he shows that reversing the roles of input 
(arrivals) end output {service completions) for a system will define a 
dual system very much like the original system, In contrast to the 
abundant supply of papers ‘on single node facilities, relatively few 
works have been published on multi-node facilities (which 1s the area of 


interest to this thesis), Among these papers which have been presented 


~6- 
is one by G.C. Hunt [19] in which he considers sequential arrays of 
waiting lines. We presents a table which gives the maximum utilization 
factor (ratio of average arrival rate to msximm service rate) for which 
steady state probabilities of queve length exist, under various allow~ 
abal queue lengths between various numbers of sequential service facili- 
ties. J.R. Jackson [20], in 1957, published a paper in which he invest i- 
gated networks of waiting lines, His netvork consisted of a number of 
Service facilities into which customers entered both from external sources 
as well as after having completed service in another facility, He proves 
a theorem which stated roughly, says that a steady state distribution for 
the system state exists, as long as the effective utilization factor for 
each facility is less than one, and in fact this distribution takes on a 
form identical to the solution for the single node case.” in 1960, 
R. Prosser [21] offered an approximate analysis of a random routing 
procedure in a commmication net in which he shows that such procedures 
are highly inefticient but extremely stable {i.e., they degrade grace- 
fully under partial failuze of the network). 

The two importent characteristics of the communication nets that 
form the subject of this thesis are (1) the number of nodes in the sys- 
tem is large, and <2) each node is capable of storing messages while they 
wait for transmission channels to become available. As has been pointed 
out, Queuing Theory has directed most of its effort so far, toward 
single node facilities with. etorage. There has been, in addition, a 
considerable investigation into multi-node nets, with no storage capa- 
bilities, mafaly under the titie of Linear Programming (which is really 
a study of linear inequalities and convex sets). This latter research 
considers, in effect, steady state flow in large connected nets, and 
tas yielded some interesting results, One problem which has attracted 
a lot of attention is the shortest route problem {also known as the 
travelling salesman problem). M. Pollack and W, Wiebenson [22] have 


presented a review of the many solutions to this. problem, among which 


a 


ÎJackson's work is discussed in detail in Section IV. 


Peay es 
are Dantzig's Simplex Method, Minty's labelling method, and the 
Moore-D'Esepo method, W, Jewell [23] has also considered this problem 
in some greater generality, and, by ‘using the structure of the network 
and the principle of flew conservation, has extended an algorithm due 
to Ford and Fulkerson in order to solve a varied group of flow problems 
in an efficient manner, Re Chien [24] has given a systematic method 
for the realization of minimum capacity communication nets from their 
required terminal capacity requirements (again considering only nets 
with no storage capabilities); a different solution to the same problem 
has been chtained by Gomory aud Hu [25]. In 1956, Pe Elias, 

Aa Feinstein, and C.E. Shannon [26] showed that the maximum rate of 
flow through a network, betveen any tvo terminals, ig che minimum value 
awong all simple cut-sets, Also, ia 1956, Ze Prihar [27} presented an 
article in which he explored the topological properties of communica~ 
tion nets; for example, he showed matrix methods for finding the num- 
ber of ways to travel between two nodes in a specific number of stepa. 

In 1959, P.AsP. Moran [28] wrote a monograph on the theory of 
storage. The book describes the basic probability problems that arise 
in the theory of storage, paying particular attention to problems of 
inventory, queuing, end dam storage. de represents one of the few 
works pertaining to a system of storage facilities. 

The results from Information Theory [29] also have relation to 
the communication net problem considered here, Most of the work there 
has dealt with communication between tue points, rather than comunica- 
tion within a network. In particular, one of the resulte says that 
there is a trade-off betweea message delay and probability of error in 
the transmitted message. Thus if delays are of ne censequence, trens- 
mission wich an arbitrarily low probability of error can be echieved. 
However, it is not obvious ne yet, what effect such additional intra- 
node delays vould kave in a large network of communication centers; it 
seems that some maxima additional delay exists, and if so, this would 
restrict the use of ceding methods, and perhaps put a non-zero lower 
bound on the error probability. 


ann sex 
TIX. Discussion of Proposed Procedure 


The problems associated with a multi-terminal communication net > 
as posed in the first section, appear to be too difficult for analysis, 
in an exact mathematical form, That is to say, the calculation of the 
joint distribution of traffic flou through a large (or even sual) i 
network is extremely difficult, Even for networks in which no feedback 
is present, the mathematics is unmanageable; and for those vith feedback, 
it seems hopeless to attempt an exact solution, The question, then, is 
to what degree, and in what fashion can we simplify this problem? 

Since it is the complicated interconnections that cause most of 
the trouble, one would like to isolate each node, and perform an indi- 
vidual analysis on it, under some boundary constraints, The node could 
then be represented by the results of such an analysis. In particular, 
‘that the node representation would be sufficiently complete, 
by the`use of perhaps two numbers, these numbers being the mean and 
variance of the traffic handled by the node, Thus, instead of having to 
derive tre complicated joint distribution of the traffic in the network, 
one may be able to make a fairly accurate characterization by specifying 


tuo (or at most a few) parameters. . 

This approach is rot completely naive and without justification. 
Consider the linear programming techniques [22-26] mentioned in the 
second section ef this proposal. The problems handled by such tech- 
niques kave a great deal in common with the communication problem at 
hand, Their problem is that of solving networks in which the commodity 
Llego, water, pecple, information) flows steadily. A typical problem 
would be that of finding the set of solutions (commonly referred to as 


feasible sulutiens) which would support a given traffic flow in a net- 
work, A solution would consist ef specifying tha flew capacity for each 
dink between all pairs of nedes. In general, a large number of solu- 
tions exist, and a lot of effort bas been spent in minimizing the 

total capacity used for such a probleme One obvious requirement is that 
the average traffic entering any node must be less than rhe total capac- 


= 9 = 
ity leaving the node, Notice that the important statistic here is 
the average traffic Slow, and if the flow 4s steady, then we have a, 
deterministic problen, Now, in what way does this problen differ fron 
the problem considered in this proposal? Clearly, the difference is 
that ve co net have a steady flow of traffic. Rather, our traffic cones 
in spurts, according to seme probability distribution, Consequently, ve 


pared to vaste some of our channel capacity, i.@., the chan- 
neh will sometimes be idle. A good measure of how non-steady our traf- 
fic is, is tke variance of the traffic, That is, for zero variance, we 
ave reduced to the special case above, namely, steady flew, As the vari- 
ance goes up, we can say less cnd less about the arrival time, and the 
traffic becomes considerably more vandon in tine, Thus, it is reasongbie 
to expect that the tro inportant parameters which characterize our traf- 
fic ave the mean and vazianse of the flow. Notice that a necessary, but 
clearly net sufficient condition Sor a feasible solution to eur problem 
is that the average traffic entering the node must ke less than the total 
capacity of channels leaving the node. In 1951, Xandali [30] shoved for 
a single node with Polsson intes-arvivel tines (at a rate N per sec), an 
infinite allavable queue, aud an arbitrary service distribution (with 
mean Ia and + 
Bz) was 


tance v), that che expected waiting time in the syscen, 


i Ug + 
E = OND + S EDT 


This clearly shows a liuesr dependence on the variance. 

Reference has alveady baen wade to J.B. Jackson [20], The 
asguanptions that he mada in his arelysis of aetworks of waiting lines was 
that the axriviag traffic at each node Lad a Poisson distribution, that 


the service time was exponentially distributed, and that infinite queues 


at each node were allowed. With these assumptions, he was able to deriva 
the distribution of traffic at all the nodes. it is important te snalyze 


6 assumptions carefully. The Paisson assumption effectively 


> Jü- 
characterized the traffic with tuo parameters, This same assumption 


also effectively decoupled the nodes from each other, His results state 
that if the mean traffic satisfiea the necessary condition stated in the 
| previous paragraph, then the resultant traffic can be characterized by a 
tvo parameter description, 

| The question of queuing discipline is an interesting one, and 
one which causes some difficulties, That is, the nede must decide on a 
method for choosing some member in the queue to be served next. An 
interesting simplicatcion tothis question, and perhaps a key to the solu- 
tion of the network problem may be obtained as follows, Consider that 
class of queue disciplines which require that a channel facility never 


remain idle, as iong as the queue is non-empty (clearly this is a reason- 


able constraint), Mow, adopting a macroscopic viewooint, (1.@., renov- 
ing all labels from the members in the queue), what can be said about 
| the mean and variance of the waiting time distribution for this class of 
queue disciplines? Ic seems that some reasonably tight bounds might 
exist for this distribution, independent of the particular discipline 


used, Perhaps some other restriction on the class of disciplines will be 


required in order to obtain meaningful results, Hawever, under such a 
set of restrictions, if we can characterize the queue sufficiently well, 
| we may then be placed in a position to obtain some overall behavior for 
the networke All of the queuing problems solved to date, have considered 
a particular queue discipline (she microscopic viewpoint), and eo the 
results have been specialized to an extrenely lerge degree, Adopting 
the macroscopic viewpoint seems to be a natural step, and it is the 


intention of this research to investigate this avenue. 


| There appear to be 2 number of conflicting interests in a network 
| of this type, The things to be considered are: storage capacity at 

| 

| 


@ach node; channel capacity et each node; and message delay, There 


sts a trading relationship atong these quantities, and it is neces- 


sary to attach some quantitative measures to this trade, In fact, if 


one wishes to generalize one step further, ene can consider a multi- 


terminal communication system, in wa e signals are perturbed by 


š -il « 
noise in the system, Information theory tells us how to combat this 
disturbance, and the solution introduces additional delays in message 
transmission and reception, What effect these additional delays will 
have on the system is not clear; in fact it becomes difficult to state 
just what the overall capacity is for such a situation, Questions such 
as these are extremely important, and deserve attention. 

From the statements in this section, it is clear that an 
approximate analysis is all that can easily be obtained fer the network 
under consideration, Hopefully, the approximate answers will be reason- 
ably useful, One way to check the utilization of the results is sinu- 
lation, it is fully expected that, in the course of this research, a 
simulated net of this type will be programmed on one of the local digital 
computers, The author has access to the Lincoln laboratory TX-2 
computer, as ‘well as the IBM 709, This simulation study should serve as 
a useful ‘check onthe results and perhaps, will also serve as a guide 


into the research, 


